-
Notifications
You must be signed in to change notification settings - Fork 224
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[HOTFIX][MI100][OCL] Remove unsed kernel arg for wrw fp32 NCHW #1013
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for quick response. Constness of buffers also needs to be fixed, see #1012 (comment).
What about regression tests?
I think this is not ok, #1012 (comment) |
Or course, not. But this PR should add some regression tests that should prevent this bug in the future (see #1012). If you need assistance, please let me know and I'll add them to this branch. |
@atamazov ok sure, I think need your help on adding some tests in this branch, in case I'm not quite clear of the regression of this PR. |
This comment has been minimized.
This comment has been minimized.
Done in b6a3ff7. I would like to make this commit absolutely clear for you before merging this. Please review this commit and compare it with the description of #1012. Ask if something is unclear. |
@atamazov the flag |
…est during Smoke test stages.
@carlushuang Yes, we do not care about precision in this test. Extra kernel arg will cause the test to fail. |
We didn't have a MI100 test for OpenCL backend, so the test weren't run. dca7901 resolves this. |
# RESOLVED Conflicts: # test/CMakeLists.txt
set(ENVS_REGRESSION_ISSUE_1012 | ||
MIOPEN_DEBUG_IMPLICIT_GEMM_FIND_ALL_SOLUTIONS=1 | ||
MIOPEN_FIND_MODE=normal) | ||
|
||
set(ARGS_REGRESSION_ISSUE_1012 | ||
--verbose | ||
--disable-forward | ||
--disable-backward-data | ||
--disable-validation) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
⚓
* Remove unused kernel arg for wrw fp32 xdlops NCHW * ConvAsmImplicitGemmGTCDynamicWrwXdlops: fix metadata in igemm_wrw_gtc_gfx908.s * Add regression tests * Re-target OpenCL smoke tests from Vega to gfx908. Enable regression test during Smoke test stages. Co-authored-by: Artem Tamazov <artem.tamazov@gmail.com>
This resolve issue #1012, when under fp32 wrw, it seems feed in more kernel arg than needed. This seems OK under HIP, but in OCL it will report issue. But anyway we need keep consistent kernel arg with metadata declaration.
@shaojiewang please double check this fp32 wrw